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Abstract 

In this work we present a simple and efficient algorithm which, with high probability, 
provides an almost uniform sample from the set of proper fc-colourings on an instance of 
sparse random graphs G„ c;/„, where k = k{d) is a sufficiently large constant. Our algorithm 
is not based on the Markov Chain Monte Carlo method (M.C.M.C). Instead, we provide 
a novel proof of correctness of our Algorithm that is based on interesting "spatial mixing" 
properties of colourings of Gn.d/n- Our result improves upon previous results (based on 
M.C.M.C.) that required a number of colours growing unboundedly with n. 

1 Introduction. 

For a graph G = iV^E), a (proper) /c-colouring is an assignment a : V ^ [k] such that 
adjacent vertices receive different colours, where for some positive integer fc, [k] indicates the 
set {1, . . . , fc}. It is well known that it is NP-hard to estimate the minimum number of colours 
in a proper A;-colouring, i.e. estimate the chromatic number of G. However, in many cases there 
are estimates and upper bounds of the chromatic number e.g. if A is the maximum degree of G, 
then one can /c-colour G for k = A + 1. Furthermore, for special classes of graphs the chromatic 
number has been estimated very accurately, e.g. in [T], Achlioptas and Naor have found the 
two possible values of the chromatic number that an instance of a sparse random graph has 
with high probability, i.e. with probability that tends to 1 as the size of the graph tends to 
infinity. All these raise the interesting computational challenge of finding the number of proper 
/c-colourings for k greater than the chromatic number. 

In [H], Valiant, introduced the notion of #P-hardness and proved that counting fc-colourings 
is #P-complete. The existence of a polynomial-time algorithm for exact counting is considered 
highly unlikely. Thus, we focus on designing polynomial-time algorithms for approximate count- 
ing. Practically, the closer k gets to the chromatic number of G, the more difficult it becomes to 
estimate the number of its fc-colourings. By [7], [8] we can reduce the estimation of the number 
of fc-colourings of G to sampling almost uniformly from the set of all its proper fc-colourings. 
By "almost" we mean with distribution close, in some sense, to the uniform distribution. 

In this work, we use focus on sampling fc-colourings of instances of a sparse random graph, 
i.e. random graphs with vertices having expected degree equal to some constant, d, and fc = k{d) 
is a sufficiently large constant which scales as d}^, for sufficiently large d. 
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Definition A Let n be a positive integer and p, < p < 1- The random graph Gn,p is a 
probability space over the set of graphs on the vertex set {1, . . . ,n} determined by 

P'f'ilhj} is an edge of G] =p 
with these events being mutually independent. 

For a sparse random graph the parameter p is of the form p = d/n, where d is a positive 
real constant. We take d > 1 (otherwise the problem of sampling is trivial). 

The mathematical tool that we use for studying the problem of sampling A;-colourings of 
instances of a sparse random graph is the "spin systems" and more specifically the proper colour- 
ing model, also refereed to as antiferromagnetic Potts model at zero temperature in statistical 
physics. 

Colouring Model on a finite graph. The colouring model on a finite graph G = (V, E) and 
set of colours [5], for some positive integer 5, is defined as follows. The system consists of a set 
of sites, which correspond to the vertices of G, and each site is assigned a spin, i.e. a member 
of [S]. A configuration is an assignment of spins to V. Not all configurations can occur in the 
colouring model. A configuration that may occur is called a feasible configuration. The set of 
feasible configurations is the set of proper iS-colourings of the underlying graph G. 

For any vertex set V C V, let dV = {v ^ V \V'\ 3u G V s.t. {u,v} E E}. Consider 
the colouring C{dV') S [5]^^ , which is such that, there is a proper colouring in [S]^ with 
the vertices in dV' coloured as C{dV'). In a system where the vertices in dV' are coloured as 
C{dV'), the colour assignments of the vertices in V' is distributed uniformly over, all the colour 
assignments of the vertices in V U dV that agree with C{dV') on the vertices in 0. 

Frequently, one imposes boundary conditions on the model, which corresponds to fixing the 
colour assignment at some ^^boundary" vertex set of G; we use the term "/ree boundar]/^ when 
there are no boundary conditions specified. In this work, when we consider a system with 
boundary conditions, we always assume that there exists at least one feasible configuration 
available for this system. 

The probability to find a colouring model at a specific configuration is the uniform distri- 
bution over all proper colourings of the underlying graph. Generally, the probability of finding 
a system in a specific configuration is given by its Gibbs measure specified by this system. 

Definition B For a finite graph G = {V,E) and an integer S, let PCS{G,S,G{L)) be a 
colourings model with underlying graph G, feasible configurations all the proper S-colourings 
of G and the boundary L V is coloured as C{L). Let Q{G,S,C{L)) be the set of feasible 
configurations of the system. 

The omission of the boundary conditions parameter implies free boundary. If the first para- 
meter is a class of random graphs, e.g. Gn,p, then we consider that the underlying graph is an 
instance of this class. 

Clearly, S7(G,5,C(L)) is the set of all proper 5-colourings of G that have the vertices in 
the set L C y coloured as specified by the assignment C{L). For a system PCS{G,S,C{L)) 
we always assume that the boundary C{L) is such that ^1{G,S,C{L)) ^ 0. 

For convenience, we use the following notation rules throughout this work. In PCS{G = 
(y, £"), iS, C(A)), for A C y the colour assignment of each vertex v ^V, oi the set of vertices 
V' C V are considered to be equal to the random variables X^^ G [S] and Xyf" G [5]^', 
corr esp ondingly. 

rigorous definition of a coiouring modei invoives tfie definition of a set of functions, tfie compatibility 
functions (see [16]). However, tfie definition we give iiere is a direct consequence of tliat witli tfie compatibiiity 
functions. 
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Definition C For the system PCS{G = {y,E),S), the function //(•) : 2['^]^ [0, 1] indicates 
the Gibbs measure specified by this system. 

In the system PCS{G = {V, E),S,C{L)), for € V, we denote with n{X^\C{L)), the 
marginal Gibbs measure of the random variable X^. 

1.1 Our work and related work. 

Previous work. The pioneering work of Dyer et.al., in [3], proposes a very interesting Markov 
Chain Monte Carlo (MCMC) based algorithm, which with high probability (w.h.p.), i.e. with 
probability that tends to 1 as the size of graph tends to infinity, provides an almost uniform 
sample from the set of proper colourings of G^.d/n which uses at least Q( io'gio^gfog„ ) colours. 
Noting that, w.h.p. the maximum degree of a sparse random graph is G(logn/loglogn), to 
our knowledge, this work was the first to present a procedure for sampling colourings that uses 
fewer colours than the maximum degree. 

In parallel and independently, E. Mossel and A. Sly, have recently derived es- 
sentially the same result as we do here, (i.e. a random sampling fc-colourings of a 
sparse random graph where is a constant) by using an MCMC approach, [12j. 
Our work. Our approach is quite different. It is based on showing the validity of a specific 
spatial mixing property for the system PGS{Gn^d/nj'S), he. we show that if 5 is greater than a 
specific value which depends only on the expected degree d, then an asymptotic independence 
between the colour assignment of any vertex v and the colour assignment of any subset of vertices 
which is at sufficiently large (graph) distance from v holds, in the system. We, then, present 
an algorithm which exploits this kind of asymptotic independence and produces, efficiently, 
an almost uniform sample of the 5-colourings of the underlying graph which uses a 

constant number of colours (where the constant is an increasing function of d). For an earlier 
version of our result, see 0. Our algorithm exploits ideas which are similar to those presented 
in [TT] and [15], for counting satisfiable truth assignments in a random A;-SAT formula and 
sampling independent sets of general graphs, correspondingly. However, our proof techniques 
are novel and some technical results are of independent interest. 

A possible schema of our algorithm is the following one. The input is the finite graph 
G = {V,E), an instance of Gn^d/m an integer S. We consider the system PGS{G,S) and 
the algorithm provides a sample which is distributed "close" to the Gibbs measure specified 
by this system. The algorithm assumes an arbitrary permutation of the vertices of the input 
graph, e.g. {vi,V2, ■ ■ ■ ,Vn), and in turn it assigns them a colour as follows: For 1 < i < n, 
let Ai CV he the set of i — 1 first coloured vertices, by the algorithm, and let C{Ai) be their 
colour assignment. Assume that the colour assignment C{Ai), of the vertices in Ai, is done 
according to a probability measure which is sufficiently close to n{XAi = C{Ai))), with /x(-) 
beeing the Gibbs measure that is specified by PGS{G,S). The algorithm computes efficiently 
a "good" estimation of //(A^. = s\C{Ai)), Vs € [S], and assign Vi a colouring according to this 
probability measure. The notion of "good" estimation of fi{X^- = s\C{Ai))., Vs € [5], implies 
that this estimation should be so accurate, that it will be possible for the algorithm to colour 
the remaining vertices with distribution as close to Gibbs measure of PGS{G,S) as we initially 
wanted. 

The spatial mixing property of PGS{Gn,d/m'S), for sufficiently large S, is exploited by our 
algorithm in conjunction with the structural property of G^^d/n stated in Lemma Rl 

Lemma A Let G = {V, E) be an instance of G^^d/nj where d > 1 is a fixed positive real. With 
high probability (w.h.p.) the graph has no vertex v with the following property: The induced 
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subgraph of G that contains v and all vertices within graph distance e log n from v, contains 
more than one cycle, for any real e > such that e < (41og(e^d/2))^^. 

The proof of Lemma [A] is given in section 12.11 

Showing an asymptotic independence between the colour assignment of any vertex v and 
the colour assignment of any vertex set at distance greater than elogn, for sufficiently small 
e = e{d), implies that when the algorithm has to assign a colour to the i-th vertex the following 
holds: If the colouring C{Ai) is done with probability measure which is sufficiently close to 
^{XAi = C(j4j)), then the algorithm can have a "good" estimation of ^(X„JC(j4j)) by just 
checking the colour assignments of a very simple structured neighborhood of Vi. The notion 
of "good" estimation is the same as the one stated previously. This kind of structure in the 
neighborhood of the vertex Vi is highly desirable since then, it allows us to get a colouring of Vi, 
which is distributed as this "good" estimation of ^(X„JC(j4j)), in time which is upper bounded 
by a polynomial of n. 

1.2 Further Definitions (Spatial Dependency) 

For the graph G = iV-iE) and any two vertex sets V , V" C ^ we denote by dist{V' , V") the 
graph distance of the two sets, i.e. the minimum length shortest path between all the pairs of 
vertices (^1,^2) G 1^' x V" . 

Definition D Let G = {V, E) he an instance of G^^d/n '^'^^ ^ ^ postive real. For the vertex 
V (z V, let Gy^d,i be the induced subgraph of G, which contains the vertex v and all the vertices 
within graph distance [/J from v. 

For a measure of comparison between probability measures we use the total variation distance. 

Definition E For measures fi and v on the same discrete space the total variation distance 
d'TvilJ','^) between fj, and u is defined as 

Definition F (Spatial Dependency.) Consider the graph G = {V,E), an instance ofGn^d/m 
the positive integers 5, / and the positive real s. For each v ^ V consider the subgraph G^^d,i 
with vertex set V^^i. For a given v € V, consider also any two S-colourings Ci(Vi) and G2{Vi) 
with Vi C and dist{{v},Vi) > I, having n{G^^i,S,Ci{Vi)), n{Gy^i,S,C2{Vi)) + 0. If for 
\/v &V it holds 

drv {fi{X,\Ci{Vi)),fi{X,\C2{Vi))) < s 

with /i(-) specified by the system PCS{Gy^d,h<S), then we say that " the distance / Spatial 
Dependency of the 5-colourings of G is s" . This will be denoted as\/v £V SD{v,l) = s. 

Clearly, the above definition extends, directly, to any system with underlying graph any type 
of graph. 

It is easy to see that if "Vf G V the distance I Spatial Dependency of the 5-colourings of 
G = (y, E) is s", then in the system PCS{G, S) and for each vertex v any two colourings 
C[{Vi) and C^{Vi) with Vi as defined in Definition E and C[ and having n{G, S, C[{Vi)) / 
and n{G,S,G!2{Vi)) / it holds 

drv {fi{X.,\Ci{Vi)),fi{Xy\C2{Vi))) < s 

with /x(-) specified, now, by the system PCS{G,S). The above holds since for each v ^ V, if 
ni = {C{Vi)\Q{Gy^i,S,C{Vi) ^ 0} and = {C{Vi)\Q{G,S,G{Vi) + 0}, then clearly ^ S^i- 
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1.3 Structure of the remaining paper. 

The remaining of our paper has the following structure. In section [2l we present two basic 
properties of the spin systems which we deal with. Then, it follows a detail description of our 
sampling algorithm in a form of pseudo code accompanied by a discussion and a statement, 
without proof, of two theorems which deal with the accuracy of the result that is returned and 
the execution time of the algorithm, corespondingly. 

The proofs and an analytic discussion about the properties of the spin-system that our 
algorithm considers, are given in section [3l This work ends by presenting the proofs of the 
theorems, that deal with the accuracy and the time efficiency of the our sampling algorithm, 
section |H 

For easiness of verification of our proofs, we provide here the dependence of each lemma and 
theorem on lemmas logically preceding it: Lemma lAl Lemma [Bland Lemma [Cl have no lemmas 
preceding them. Lemma [D] has predecessors Lemma O Lemma [B] and Lemma O Lemma [E] 
has predecessors LemmaEl Lemma iBj Lemma ICl and Lemma E. Lemma|F]has no predecessors. 
Lemma O has predecessors Lemma O and Lemma |Bj Lemma |H] has predecessors. Lemma [F] 
and Lemma iGl 

Theorem |A] has precedings all the lemmas of this paper. Theorem |B] has predecessors, 
Theorem El Lemma O Theorem ICj has predecessors. Theorem [A] Theorem IbI and Lemma 1X1 
Finally, Theorem [P] has no preceding. 



2 Statement of results. 

2.1 Properties of the spin system. 

In this section we state two crucial properties that the colouring model has with high probability, 
when the underlying graph is an instance of a sparse random graph. These properties are stated 
in the following lemma and theorem. The first one, already stated in section [H refers to the 
structure of the neighborhood of each vertex in an instance of a sparse random graph. The 
second one refers to a property of the colourings (configurations) of such a spin-system. I.e. in 
a PCS{Gn^d/ni '5)) if 5 is greater than a specific value which depends only on the expected degree 
d, then an asymptotic independence between the colour assignment of any vertex v and the colour 
assignment of any subset of vertices which is at graph distance, at least, 4iog('e2rf/2) loS'^J from 
V holds. 



Lemma A Let G = {V, E) he an instance of Gn^d/m where d > 1 is a fixed positive real. With 
high probability (w.h.p.) the graph has no vertex v with the following property: The induced 
subgraph of G that contains v and all vertices within graph distance elogn from v, contains 
more than one cycle, for any real e > such that e < (41og(e^d/2))~^. 

Proof: 

To show the lemma assume the contrary, i.e. there is some vertex v £ V whose corresponding 
graph Gy^(i,eiogn (see Definition [D]) contains two cycles, i.e. Ci and G2 each of length at most 
2elogn, the value of e will be determined later. The above assumption implies that there are 
two pairs of paths starting from v, such that: The paths in each pair do not have all their edges 
common and there is some vertex in Gy^ii,eiogn that can be reached from v by both paths of the 
pair. The existence of such two pairs of paths implies that in Gy^d,e\ogn there is a set of, at most 
4e log n, vertices which have among each other a number of edges which exceeds the number of 
vertices by one. 
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Thus, the proof of the lemma reduces to showing that in Gn^d/m there is no set of, at most, 
4elogn, vertices which contains a number of edges that exceed the number of vertices in the 
set by one, for sufficiently small e. Let D be the event "such a set exists". Setting r = 4elogn 
we have 



Pr[D] < 



< 




< > — 



< 



r-l 
k=0 




n deV2 - 1 



taking e such that 4e • log(de^/2) < 1 the r.h.s. of the last equation is o(l), as the nominator is 
o(n). Thus, for sufficiently small e we have that Pr[D] = o(l) o 



Theorem A Let G = {V^E) he an instance of Gn^d/nj where d> 1. If S is a sufficiently large 
integer, which depends on d, and e = ji^^^l0d/2)' ^hen w.h.p., i.e. with probability 1 — 2n~°'^^, for 
every vertex v £V the distance [elognj Spatial Dependency of the S -colourings of PGS{G,S) 
is n~^''^^ . For sufficiently large d, we should have S > d^^ . 

We mention that, if d is relatively small, then for PG S{GnA/n-,S), the distance [elognj Spatial 
Dependency of the 5-colourings of PGS(G,S) can become n~^'^^, however, for a number of 
colours which is a constant greater than d^^. 

Section E] is devoted to the proof of Theorem E 

2.2 Our Algorithm 

We start by presenting our sampling algorithm in the form of pseudo code. The input of 
the algorithm is the graph G = iy,E) an instance of Gn^d/n with d > 1, and the integer S. 
Conditioning on the properties of the system stated in the previous section, which hold w.h.p. 
by taking sufficiently large constant S, the algorithm outputs a 5-colouring of the input graph 
distributed within total variation distance n"^''^^ from the uniform distribution over the set of 
all 5-colourings of the input graph. 

We have to mention, here, that our algorithm is based on properties of the spin-system that 
already hold w.h.p.. This means that we can expose the entire instance of the input graph at 
the beginning and expect these, desired, properties to hold, which is highly likely. 

In what follows, we assume that Vi is the i-th vertex to be coloured by the algorithmic and 
Ai is the set of vertices that have already been coloured, before Vi. We, also, assume that Ai is 
coloured as G{Ai). We denote with {Vi, Ei) the vertex set and the edge set, correspondingly, of 
the graph G^^^d,e\ogn, where e = ^^^^^^^212) 

^We remind the reader that the algorithm colours one vertex the time. 
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Sampling Algorithm 

Input: G = {y,E), instance of Gn,d/m number of colours S 

Take an arbitrary permutation of the vertices in V, i.e. (vi, . . . , f„) 

^1 = 

For i = 1, . . . ,n 

-Create the subgraph Gv.^d,e\ogn = (yi,Ei) 

-If Gy^^(i,eiogn = (Vii Ei) is not a tree or a unicyclic graph 

Then Return Failure 
-Colour Vi according to /xj (X^ . | C(j4j n Vi)) 
using dynamic programming 
-Ai+i := Ai U {vi] 
Return Colouring of G 

The probability measure jli{-) is the Gibbs measure specified by the system PCS{Gy.^d,eiogm 'S)- 
Note that if the algorithm, at each iteration of the for loop, had assigned to the vertex 
Vi a colouring, according to fi{Xy.\C{Ai)), instead of ili{Xy.\C{Ai n Vi)), then it would have 
been exact. I.e., the distribution fj,i{Xv-\G (Ai n V)) is an estimation of iJ:{Xy^\C (Ai)) for our 
algorithm. 

There are two issues to be clarified about the algorithm above. The first one is its accuracy, 
i.e. how close is the distribution of the colouring that is returned, to the uniform distribution 
over all proper 5-colourings of the input graph. The second one is its efficiency, i.e. how much 
time is needed for the execution of the algorithm with respect to the size of the input graph. 

As far as the accuracy of the algorithm is regarded, we use Theorem O By the discussion 
at the end of section 11.21 we see that Theorem |A] implies that, w.h.p. the Gibbs measure 
specified by PCS{Gn^d/ni S)-, where 5 is a sufficiently large constant, is such that an asymptotic 
independence (spatial mixing) holds, for the colour assignment of any vertex v and the colour 
assignment of any vertex set at distance [elognj from v, where e = 4 \og(^id/2) • 

We claim that at the i-ih. iteration of the for loop of the algorithm, the following holds: If 
the algorithm has assigned a colouring to the vertices in Ai, according to the Gibbs measure, 
^(•), that specifies PCS{G,S), then it, still, holds the same asymptotic independence as that it 
is implied by Theorem \K\ between the colouring of the vertex Vi and the colourings of vertices 
at distance, at least, [elognj from Vi, where e = 4 iog('e^d/2) • 

Equivalently, we can think of the following situtation. Consider two systems, i.e. Si = 
PCS{G,S) and ^2 = PGS{G,S), with underlying graph G, the input of our algorithm and 
with each system being independent of the other. Assume that, in both systems, we fix the colour 
assignments of the vertices in Ai C V according to //(X^.), with /i(-) specified by PGS{G,S). 
Assume that after having fixed the colour assignments, of the vertices in Ai, we look at the 
colour assignments of the vertices at graph distance, at least, [elognj from the vertex Vi, in 
both systems. Let V' be the vertex set whose colourings have been seen and let C(y') be the 
colouring we see in Si and C'{V') be the colouring we see in 5*2 . The above claim, is equivalent 
to saying that knowing the colour assignments of the vertices in V' in both systems, then the 
total variation distance between the probability measures of the colour assignments of Vi in the 
two systems, correspondingly, is upper bounded by the quanitity SD{vi, [elognj). I.e. 

dTV {E[^i{X,^\Xa,,Xv' = C{V')),E[^i{X,^\XA,,Xv' = C'{V'))) < n-^-''\ (1) 

Where both expectations, are taken over all colourings of A^. where the probability of each 
colouring is according to Gibbs measure /i(-). 



7 



The fohowing theorem, Theorem [Bl (in the proof of which it is shown the vahdity of the 
claim above) gives a characterization of the distribution of the colouring that is returned by 
the algorithm in terms of its total variation distance from the uniform over all the proper 
5-colourings of the input graph, if S is as large as indicated in Theorem El 

Theorem B If S is a sufficiently large integer constant, then, with probability 1 — 0{n~^'^), 
the sampling algorithm is successful and returns a S-colouring of the input graph G, whose 
distribution is within total variation distance n~^'^^ from the uniform over all the proper S- 
colourings of G. 

The proof of Theorem [B] is given in section [H 

As far as the execution time of the algorithm is concerned, we make the following remark. 
According to Lemma El the set {G^^^d^eiogn, for z = 1, . . . ,n}, for e = 4i^gp^, w.h.p., i.e. 
with probability 1 — n~^'^, contains graphs which are either unicyclic or trees. If this is not the 
case, then we consider that the algorithm fails. As argued in we can have a colouring of the 
vertex vi according to fii{Xy^ \G{Ai n V^)) by generating a random colouring of G^^^d^eXogn where 
the vertices in D are colored as G{Ai n Vj) in time upper bounded by / • k^, where / = \Vi\ 
and k = S (for more details see the proof of Theorem O and 0] ) . 

Theorem C The time complexity of the sampling algorithm is w.h.p. asymptotically upper 
bounded by 0{n'^), where n is the number of vertices of the input graph. 

The proof of Theorem O is given in section jH 

We note that at the z-th iteration of the for-loop of the algorithm, we can apply the Junction 
tree algorithm (see [16]) to assign the vertex Vi a colour according to the probability measure 
Jli{Xu\G{Ai n Vi)). The execution time of the junction tree is asymptotically bounded by 
0{n'^~^^), where c < 1 is a sufficiently large constant. 

3 Spatial mixing. 

According to Lemma lAl if G = {V, E) is an instance of Gn^d/n^ then the set of graphs Gy^d,eiogn, 
for V & V, with e = jj^^^^^y^' '^•^-P- contains graphs which, each of them, is either unicyclic or 
trees. Instead of prooving Theorem El equivalently, we show the two following lemmas, which 
are proved in sections 13.21 and 13.31 correspondingly. 

Lemma E Consider the system PC S{Gy^d,6\ogn,<S), ford>l,e= 4iog('Jrf/2) <^'^d for Gy^d,e log n 
we condition that it is a tree. If S is a sufficiently large constant, then with probability at least 
1 — 2n~^'^^, for the above system it holds that SD{v, [elognj) = n~^'^^. For sufficiently large 
d, we should have S > d^'^. 

Lemma H Consider the system PGS{Gy^d,e\ogn,<S), ford> I, e= 4iog°e2^/2) "-^d for Gy^d,e log n 
we condition that it is a unicyclic graph. IfS is a sufficiently large constant, then with probability 
at least 1 — 2n~^'^^, for the above system it holds that SD{v, [elognj) = n~^''^^. For sufficiently 
large d, we should have S > d^^. 

One can see that the lemmas [E] and |H] imply Theorem El see Corollary El 

Corollary A If Lemma and Lemma \I^ are true, then Theorem [A\ is true, as well. 
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Proof: Assume that Lemma lEl and Lemmas iHl are true. Consider the system PCS{Gn^d/m'^)y 
where 5 is a sufficiently large constant and for sufficiently large d, S > d^^. Theorem lAl holds 
for PCS{Gy^iii^^S) if the following event holds with probability at least 1 — 2n~^''^^ 

Eventi = ^^for every graph Gy^d,tiogn of the set of graphs that Gn^d/n specifies, it holds 
that the PCS{Gy^d,eiogn,S) has the property that SD{v, [elognj) = n^^'^^" 

By Lemma lEl and Lemma iHl we have that for sufficiently large S, which for sufficiently large d 
becomes S > d^"^, such that, for every vertex v in G^ d/n the 

Evenly = " the system PCS{Gy^d,eiogn,-S) has the property that SD{v, [elognj) = n~^''^^" 
holds with probability at least 1 — 2n~^'^^. Clearly 

Pr[Eventi] = 1 — Pr 



UyEventy 

By the union bound we get that Pr[Eventi] > 1 — 2n~^''^^, which proves corollary. o 

Note. In both cases the crucial point is to show that a certain quantity (the "disagreement 
probability") has a small expected value. This quantity measures how much we deviate if, for 
colouring some vertex v in the algorithm, we consider only a heighborhood of its, instead of the 
whole graph. In the sequel we carefully establish the necessary upper bounds for this. 
Remark. Both Lemma [E] and Lemma |H] are based on the fact that we expect a very large 
proportion of the vertices of an instance of G^ d/n to have constant degrees. I.e. there is a 
constant cq = co((i) such that for any c > cq the expected proportion of vertices that have 
degree less than c tends to 1, exponentially fast with c. This argument is justified by the 
following corollary that is proved in [6]. 

Corollary B // a random variable Z is distributed as in B{n,q), the binomial distribution with 
parameters n and q, with X = nq then 

Pr[Z >x]<e-'' x> 7A. 



3.1 The process ColourRoot and a coupling. 

Towards proving LemmalEland LemmalHl we introduce, here, the stochastic process ColourRoot 
(T, S,C{L)), where T = {V,E) is a tree, 5 is a positive integer and the vertices in L C V are 
assigned a colouring C{L), such that Q(r, 5, C(L)) ^ 0. The process ColourRoot{T, S,C{L)) 
assigns a colouring (not necessarily proper), to the vertices in V\L such that Vu G V\L its 
colour assignment is distributed as in n{Xu\C{L n T^)), where is the subtree of T rooted at 
u while the Gibbs measure fi{Xy\C{L fl T„)) is specified by the system PCS{Tu,S, C(L n T^)). 
When the third parameter of the ColourRoot is omitted, it is implied that there is no fixed 
colour assignment to any vertex. 

The ColourRoot(T, 5, C(L)) assigns a colouring to each vertex u in the tree T, based on 
the following observation. For the vertex u of T consider the vertex set CHu which contains 
the children of u in and the system PCS{Tu, 5, C(L n T^)). For the graph Tq = Uwi^cHu'^w 
consider the set of 5-colorings Qq = ^1(Tq,S,C{L HTq)) Assume that each C £ specifies a 
colouring of the vertices in CHu that uses all but Wc colours from the set [<S]. Note that if 
the system PCS(Tu,S,C{L n T^)) is in equilibrium, the probability for the vertices in Tq = 

^w&CHyTw to be coloured as specified by C G is proportional to the quantity Wc, i.e. 

Wc 
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Definition G With the above notation, the process ColourRoot(T, S, C{L)) assigns a colour 
to the vertex u of T , as follows: 

1. Each C ^ flQ is assigned weight, Wq, equal to the number of colours in the set [S] that do 
not appear in the colour assignment that C specifies for the vertices in CHu- 

2. Select from JIq such that the probability for each member to be chosen is proportional to 
the weight it has been assigned to it. Let C be the chosen member. 

3. Assign to the vertex u a colour that is chosen uniformly at random among the colours in 
the set [S] that do not appear in the colouring of the vertices in CH^, as this is specified 
by C. 

For a coupling of the processes ColourRoot(T, 5, C(L)) and ColourRoot {T,S,C'{L)), we 
introduce the notion of disagreement probability. 

Definition H Consider a coupling of ColourRoot {T,S,C{L)) and ColourRoot{T , S , C (L)) . 
The disagreement probability for a vertex u in T, denoted by pu, is equal to the probability for 
the coupling to assign different colours to u. 

The couphng of ColourRoot (T, 5, C{L)) and ColourRoot(T, S, C'{L)) is of our main interest 
here, due to the fohowing, very significant fact. 

Theorem D Consider the tree T = {V, E) rooted at the vertex r, some set A QV and any two 
colourings of the vertex set A, such that Q{T,S,C{A)), Q{T , S , C (A)) ^ 0. Assume that there 
is a coupling of the ColourRoot{T,S,C{A)) and the ColourRoot{T,S,C' (A)), for some integer 
S, such that the probability of disagreement for the root r is pr. Then, it holds that 

dTv{KXr\C{A)), fl{Xr\C'(A))) < pr. 

where fi{Xr\C{A)) and fj.{Xr\C' (A)) are specified by the system PCS{T,S). 

Proof: The theorem follows directly from the Coupling Lemma (see [2]). o 

For the system PCS{T, S), where T is a tree rooted at vertex r, one can derive upper bounds 
for SD{r,l), for some positive integer /, by using the above theorem and the coupling of the 
ColourRoot, as described in the following definition. 

Definition I Consider the tree T = {V, E) rooted at vertex r, an integer S and the set Vi CV 
such that dist{{r},Vi) > I for some integer I. Let C{T,S,l) be the coupling of the processes 
ColourRoot(T, S,Ci{Vi)) and Colour Root (T,S). The colour assignment Ci{Vi), is taken so 
as to maximize the disagreement probability at the root of T. The coupling C{T, S, I) assigns 
colours to the vertex uofTas follows: 

• Couple step 2 of the two processes so as to maximize the probability for the vertices in 
CHu to have the same colour assignment. 

• Conditional on the choices the two processes have made at their step 2, assign colours to 
u, so as to minimize the disagreement probability pu- 

In the coupling C{T,S,l), if the height of T is less than /, then the set Vi, as given in 
Definition [J, is empty. It is easy for one to see that in that case, the disagreement probability 
for all vertices in T, is zero. 
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Corollary C Consider a tree T, rooted at vertex r. If the coupling C{T, S, I), for some positive 
integers S,l, has disagreement probability pr for the root of T , then for the system PCS{T,S) 
it holds that SD{r,l) < 2pj.. 

Proof: For the tree T, rooted at vertex r, and the integers S, I, assume that the couphng 
C(T, 5, 1) has disagreement probabihty on the root pr- Consider the vertex set L, which contains 
vertices at distance, at least, I from the root r. Let, also, C{L) and C{L) be the two colourings 
which maximize the total variation distance of the measures fi{Xr\C{C{L)) and iJ,{Xr\C{L)), 
as these are specified by the system PCS{T,S). It holds that 



the second direvation follows by the triangle inequality for measures. The corollary follows, o 

We note to the reader that, here, we will not need to give an explicit desciption of the 
coupling C{T,S,l). It suffices to show that C{T,S,l) has two specific properties, those that 
indicated by Lemma [B] and Lemma 1^ . 

In the rest of this section, we state and prove Lemma [B] and Lemma O These two lemmas 
provide means to derive upper bounds for the probability of disagreement, in the coupling 
C{T,S,l), for each vertex u of T. More specifically. Lemma IB] and Lemma O provide an 
inductive description of the coupling C{T,S,l), in terms of the disagreement probabilities. I.e. 
considering the vertex u and the set CHu of its children, if the coupling C assigns colours to 
each vertex w G CHu such that the probability of disagreement is py^, then for the vertex u the 
probability of disagreement pu, in C, can be bounded as follows 



where a{\CHu\,S) is a constant that depends on the cardinality of CH^ and S. 

We distinguish two classes of vertices in T regarding the relation between their number 
of children and the number of available colours S, i.e. the mixing vertices and the nonmixing 
vertices. The mixing vertices have a number of children which is smaller than S and the constant 
a{\CHu\,S) is very small for them, i.e. << 1. The nonmixing vertices have high degrees and 
for them the constant a{\CHu\,S) may become very large. 

Definition J Each vertex u of the tree T is "mixing" if, for a given t, the number of its children 
in T is less than or equal to t, otherwise it is "nonmixing" . 

The value of t, the maximum number of children of a mixing vertex, in the coupling C{T, S, I) 
is always less than the number of available colours. Generally, for a given tree T and number of 
colours 5, we take t so large as to minimize the probability of disagreement of the root of the 
tree T in C(T, 5, 1). 



^ However, if the reader is keen on finding one, then he can deduce one from the proofs of Lemma [B] and 
Lemma [Cl and the proofs of the claims inside them. 



SD{r,l) = dTv{KXr\C{C{L)),f,iXr\C{L))) 

< drv {fl{Xr\C{C{L)),f,{Xr))+dTV {MXr),fl{Xr\C(L))) 

< 2pr 
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Lemma B Let u be a vertex of the tree T which is mixing. If for every vertex w G CHy, the 
probability of disagreement, in the coupling C{T,S, I), ispw, then for the vertex u the probability 
of disagreement pu, in C, is bounded as 



where t is the maximum number of children of a mixing vertex. 

Proof: Assume that in the coupling C(T, 5, /), when the vertex u is to be coloured, the processes 
ColourRoot(T, 5, Ci(yi)) and ColourRoot(T, 5) at their second step choose from the set of 
colourings Q-c and O^?, correspondingly. 

Let A be the event that, when the vertex u is to be coloured in C the members of ili? and 
Vic, that are chosen at step (2) of the processes ColourRoot, specify different colour assignments 
for the vertices in CHu- Then 

-Pr [disagreement on n] = Pr [disagreement on n|^]Pr[^] + Pr [disagreement on n|^]Pr[^] 

We mention that, if the event A does not hold {A holds), then there is a coupling for step (3) of 
the ColourRoot that assigns the same colour to the vertex u in C, i.e. Pr [disagreement on u\A\ = 
0. Thus, 

Pr [disagreement on n] = Pr [disagreement on ti|^]Pr[^] (2) 

To show the lemma we provide appropriate upper bounds for the probabilities in ([2]), i.e. 
Pr[^] and Pr [disagreement on u\A\. We start with bounding Pr[^], 

We note that the assumption that for each w G CHu, the disagreement probability in 
C, is Pw can be seen as follows: There is a coupling, call it )Ci, which chooses uniformly at 
random (u.a.r.) from the sets and and the two chosen elements specify different colour 
assignments for the vertices in CHu with probability which is upper bounded by Z^iueCH^Pw 

Note that \^f\ \^c\- We create the set Q'p such that, each element of ^Ip appears \^c\ 
times in il.'p. Similarly, we create the set Q'^ such that, each element of ^Ic appears \i^F\ times 
in Q'q. Clearly, = \^'p\- 

Claim A We can have a coupling, call it IC2, that chooses uniformly at random (u.a.r.) an 
element from each of the sets Q'q and 0,'p, such that the probability for the two chosen elements 
to specify different colour assignment for any vertex in CHu is upper bounded by J2weCHuPw- 

Assume that in C, each of the executions of ColourRoot, at step (2), considers the sets Q'q 
and Q'p, correspondingly, instead of ^Ic and Qp. Clearly, the fact that each of the processes 
ColourRoot considers the set ^I'q instead of Vic and ^'f instead of Vlp, correspondingly, does 
not change the marginal probability measure of the colour assignment of the vertex v, in the 
coupling C. 

Claim B Assume that the number of children of the vertex u is k and the disagreement prob- 
ability for w G CHu is Pw If at the coupling of the second step of the processes ColourRoot in 
C{T,S,l), each C G U Q'p is assigned weight Wq, then for the event A we have that 



1 ma ^c&n'pVjn'^iWc] 

V,yjn'^{Wc\Wc > .J ^^^^^ 



^ qk,smincen'un'{Wc\Wc>0} ^ 



where qk^s is the probability of the event that after k trials, not all elements of [S] have been 
chosen, when at each trial we choose u.a.r. a member of [S]. 
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Note that the number of children of a mixing vertex is less than the number of available colours, 
thus, there is no colouring of the vertices in CHu that leave no available colour for u. Since 
we have assumed that u is mixing, it is clear that in our case q^^s = 1- Also note that at the 
coupling of step (2) of the ColourRoot, for colouring the vertex u, no member of either 
or ^}'p is assigned a weight which is more than S and less than S — t, where t is equal to the 
maximum number of children that a mixing vertex can have. Thus, we have 

weCHu 

We proceed to derive a bound for Pr [disagreement on u\A]. For this we use the following claim. 



Claim C Consider the coupling C(T, S,l) when it assigns colourings to the vertex u. Assume 
that the two processes chose members of Q,'p and ^'q that specify colourings for the vertices in 
CHu such that for the vertex u, there are two lists of available colours, li andl2, correspondingly. 
Assuming that \li\ > 0, for i = 1,2, there is a coupling that can choose the same colour for the 
vertex u with probability at least 

^ _ maa;{|/iV2|, 1^2 Vi|} 
min{\li\, \l2\} 

By Claim Id we have that 

■/■ 

Pr [disagreement on n[^] < -j-; 



-t 

since, |/i\/2|7 |^2\^i| and \li\, I/2I > S — t, where li,l2 are as defined in the statement of Claim 
ICl Combining all the above we get the lemma. o 

We now proceed to prove the claims stated in the proof of Lemma |Bl 

Claim A We can have a coupling, call it IC2, that chooses uniformly at random an element 
from each of the sets Q'q and Q'p, such that the probability for the two chosen elements to specify 
different colour assignment for any vertex in CH^ is upper bounded by J2weCHuP^- 

Proof: The coupling /C2 is defined as follows: Choose u.a.r. a member of f^c*) let C be the 
chosen element, by using /Ci, take the corresponding element of ^Ip, let C be that element. 
Then, choose u.a.r. one among the copies of C in 0^ and one of the copies of C in Clearly, 
each of the elements of ^I'q and ^I'p is chosen, uniformly at random. Our claim follows by noting 
that, the chosen elements of Q'q and Q'p differ in the colour assignments of the vertices in CHu 
iff C and C do so. o 



Claim B Assume for the vertex u that \CHu\ = k and the disagreement probability for w € CHu 
is pw If at the coupling of the second step of the processes ColourRoot in C{T,S,l), each 
C G ^'(j U ^'p is assigned a weight Wc, then for the event A we have that 



1 maxc^n'^un'A^c} 



- q,,s m^r^c,n',Z'JWc\Wc > 0} ^^^^/^ 



where qk,s is the probability of the event that after k trials, not all elements of [S] have been 
chosen, when at each trial we choose u.a.r. a member of [S]. 
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Proof: Consider that we choose from Q'p such that the element C is chosen with probabihty 
proportional to its weight, Wc- Consider the same for the set ri^. If there is a coupling of these 
two random weighted selections above, such that the probability of the event A to be upper 
bounded as in ([3]), then we are done. 

The assumption that in the coupling C{T,S,l), for each w £ CHu, the disagreement prob- 
ability is Pu], is equivalent to the following: There is a mapping, call it / : ^I'p — > Q'q, which is 
one to one (and 'onto', since \^'p\ = \^'c\) and for any pair of colourings {C,f{C)) G x ^1'^ 
chosen u.a.r. the probability to specify different colourings for the vertices in CHu is upper 
bounded by Y^waCH^Pw- 

Clearly the mapping / defines a coupling for the "nonweighted" joint random selection of 
the elements the sets Vt'p and fi^, since the two sets are equal sized. Based on / we define a 
coupling for the "weighted" joint random selection of the elements of the sets Q.'p and fi^. 

From 0^ we produce the set fij^, for i € {C, F}, as follows: For each C G Vl'^, insert into 
Oj^, Wc copies of C, i.e. the elements {Ci, . . . , Cwq}- The weighted random selection from Q'^ 
is equivalent to consider that we have chosen C € if a random uniform selection from 
have chosen one of {Ci, . . . , Cwq}- Thus, the construction of a coupling of the weighted joint 
selection from the sets Q'p and Q'q can, equivalently, be reduced to creating a coupling that 
selects uniformly at random one element from each of the sets Q,^ and 17^. This is what are 
we doing in what follows. 

First, we create a mapping /' : {Q^ U UJ2) U uJi), where u!2 C fij^ and uJi C 

and they will be defined soon after. The mapping /' will be created based on the mapping /. 
Then we define the coupling which consists of choosing u.a.r. a member of ft^ U uj2 and then 
applying the chosen element to /' so as to get a member of Q,^ U toi. In this coupling, the 
marginal probability for each member in to be chosen, will be the same for all members. 
This should also hold for the members of ■ The claim will follow by bounding, appropriately, 
the quantity Pr[A]. 

We will define the sets loi, 0J2 as we construct /'. The mapping /' is defined as follows: 
For each C G n'p, with /(C) = Q and Wc = ^/(c) > 0, set f'{Ci) = Qi for i = 1, . . . , Wc- 
For each C € Q.'p, with Wc > and /(C) = Q, set /'(Q) = Qi for i = l,...,Wf(^c) 

and for i = Wji^c) + ^1 ■ ■ ■ set f'{Ci) a u.a.r. chosen member of Let uJi be the 

set of all the elements of Q}q that were randomly selected, as described above. For each 
C G 9.'p, with Wc < Wf^c) and /(C) = Q, we set /'(Ci) = Qi for i = l,...,Wc, and for 
i = Wc + 1, . . . , foi' the copy Qi choose u.a.r. a member of Q'p to correspond to. Let L02 

be the set of all the elements of il'p that were randomly selected, as described above. 

If we choose uniformly at random from Qp^ U UI2 , each element of Qp^ appears equiprobably. 
Similarly, if we choose u.a.r. from Uwi, each element of appears equiprobably. Fur- 
thermore, if we choose u.a.r. from U u!2, and apply /' to get a member from U oji, all 
the members of il^ U loi have the same probability to be chosen, since the mapping /' is one to 
one and onto (|f^^ Uu!2\ = UtJi). Thus, in the coupling where we choose u.a.r. UUJ2 and 
apply the mapping /' and get a member of ^I'q, the marginal probability for all the members 
of (and to be chosen is the same. 

What remains to be shown is that, in the coupling above, the event A occurs with probability 
Pr[^] which is upper bounded as in ([31). 

Clearly, for the colourings in the pairs (C, /(C)) G il'p x that define the same colour 
assignments for the vertices in CHu, we have Wc = ^/(c)- I^or C G Q'p in such a pair of 
colourings, it holds that the copy Cj, that C has in , is corresponded through /' to the copy 
Qi, that Q = f{C) has in , i.e. Qi = f{Ci), for i = 1, . . . , Wc- Note that the event A does 
not hold for these pairs, (Cj, f'{Ci)) for i = 1, . . . , Wc- 

For each pair (C, f{C)) G Vl'p x 0^ that define a different colour assignments for the vertices 
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in CHu, it does not necessarily hold Wc = Wf{c)- Consider, first, the case where Wc = ^/(c)- 
Then, for C € $1^ in such a pair of colourings, it holds that each copy Cj, that C has in Q!p , 
is corresponded through /' to the copy Qi, that Q = f{C) has in $7^, for i = 1, . . . , Wc- The 
event A does hold for these pairs, (Cj, f'{Ci)) for i = 1, . . . , Wc- 

Finally, we consider the case where the pair (C, f{C)) € il'p x Q'^ defines a different colour 
assignments for the vertices in CHu and Wc ^/(C)- W.l.o.g. we assume that Wc > W^/(c)- 
Then, for C G Q,p in such a pair of colourings, it holds that each copy Cj, that C has in , is 
corresponded through /' to a copy of Qj, that Q = f{C) has in for i = 1, . . . , Wj(c'). The 
event A does hold for the pairs (Cj, f'{Ci)), i = 1, . . . , Wf(^c)- The remaining copies Cj, that C 
has in fij^T, through /' are mapped to u.a.r. chosen member of , for i = Wf(^c) + 1, • • • i Wc- 
Note that the event A does not necessarily hold for these pair. However, we assume that it 
does, which, clearly, is an overestimate for the probability Pr[^]. 

Let, cnfu UJ2 be such that 0]f = {C € U wal for (C7, f'{C)) the event A holds} 

and VtA C n'p be VLa = {C Q.'p\ for (C, f{C)) the event A holds}. Clearly, Pr[A] = -^^f^y 

Let l^S^"^ C be such that °^ = {C7 e 9.[\Wc > 0} and = ^£2, for i e {C, F}. One 

can see that < niaxcgf^/^yf^/^lVFc'}, and |0^Uu;2| > l^^^'^^l ■ Tam(-,^Q>o^J^>o{Wc}- 

From the fact that \ = qfI^fI we get that 

• minc.gf^>Ouf^>o{Wc} I^^fI ' 

Clearly, qp = qk,Si where qk^s is as defined in the statement of the claim. The claim follows by 
noting that {g^ < Y.w&CH^Pw- o 



Claim C Consider the coupling C(T, S, I) when it assigns colourings to the vertex u. Assume 
that the two processes chose members of Qp and i^c that specify colourings for the vertices 
in CHu such that for the vertex u, there are two lists of available colours, the li and the I2, 
correspondingly. Assuming that > 0, for i = 1,2, there is a coupling that can choose the 
same colour for the vertex u with probability at least 

^ _ max{|/iV2|, 1^2 Vi|} 
mm{|/i|, 1/2!} 



Proof: The coupling that can choose the same colour for the vertex u with probability indicated 
in the statement of the claim is the maximal coupling (see |10)). 

More speciically, we assume, w.l.o.g., that \li\ > \I2\- Let C/ be a random variable uniformly 
distributed in (0, 1). We assume that if ^ <U < -^we choose the color i € /i, for = 1, . . . , 

Also, Vi € /i n I2 assume that if ^ ^ ^ |I7[ choose i in I2- For U everywhere else in 
(0, 1) make an arbitrary arrangement so as each element of I2 to be chosen with probability 
1//2- By the assumption that |/i| > I/2I, to all members i G fl ^2 we have assigned intervals 
which correspond to probability l/|/i| < 1/1^21- 

Clearly, the interval in (0, 1) that corresponds to choosing different colourings from li and 
I2 is of length -^^j^^ • The claim follows by the fact that 

I/1V2I ^ max{|/iV2|, |^2\^i|} 
~ mm{|/i|, 1^2!} 
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Lemma C Let u he a vertex of the tree T which is nonmixing and has k children. If for 
every w E CH^ the probability of disagreement, in C{T, S, I), is pw, then for the vertex u, the 
probability of disagreement pu, in C, is bounded as 

Pu<S—l pA (4) 



Qk,s 



and Qk^s is the probability of the event that after k trials, not all elements of the set [S] have 
been chosen, when at each trial we choose uniformly at random a member of [S] . 

Proof: Assume that in the couphng C{T, S, I), when the vertex u is to be coloured, the processes 
ColourRoot(T, 5, Ci(Vi)) and ColourRoot(T, 5) at their second step choose from the set of 
colourings i^c and ^Ip, correspondingly. 

Let A be the event that, when the vertex u is to be coloured in C the members ofQp and l^c 
that are chosen at step (2) of the two processes Colour Root, correspondingly, specify different 
colour assignments for the vertices in CHu- Then 

Pr [disagreement on n] = Pr [disagreement on u|^]Pr[^] + Pr [disagreement on u|^]Pr[^] 

We mention that, if the event A does not hold {A holds), then there is a coupling for step (3) of 
the ColourRoot that assigns the same colour to the vertex u in C, i.e. Pr [disagreement on u\A\ = 
0. Thus, 

Pr [disagreement on u] = Pr [disagreement on ti|^]Pr[^] (5) 

To show the lemma we derive appropriate upper bounds for the probabilities in ([5]), Pr[^] 
and Pr [disagreement on u\A]. We work exactly in the same manner as in the proof of Lemma 
|B]so as to get an upper bound for the term Pr[^], i.e. as in Lemma [B] we have 

J - qk,s mincen',un'jWc\Wc > 0} 

Note that at the coupling, of the step (2) of the ColourRoot, for colouring u, no member of either 
Q'fj or Q'p is assigned weight more than S and the minimum non zero weight is 1. Furthermore, 
for a nonmixing vertex u, of sufficiently high degree, there are colourings of its children that use 
every colour in [S], these colourings are assigned weight zero, in in this case we have ^^,5 ^ 1- 
The lemma follows by assuming that Pr [disagreement on = 1 which is, clearly, an 
overestimate. o 



3.2 The case of a tree - How to prove Lemma [El 

Consider an instance of Gn^d/n-, where d> 1, and for each vertex v consider the graph Gv^d,eiogn, 
where e = j^;^^Tjj2)- -^Y Lemma \K\ it holds that w.h.p. Gy^d,eiogn is either a unicyclic graph or 
a tree. Here, WG condition that th.6 graph. Gv^d^eiogn is a tree. 

Definition K The graph Gy^d,eiogn when we condition that it is a tree, defines a probability 
space over the trees which we call T^- 
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Note that each nonleaf vertex of an instance of has a number of children whose distribution 
is dominated by B{n,d/n), i.e. the binomial distribution with parameters n and d/n. 

Clearly, Lemma [E] will follow by showing that if iS and e are as large as specified by this 
lemma, then with probability, at least, 1 — 2n~^'^^, for the tree T, an instance of rooted at 
the vertex r, the disagreement probability pr in the coupling C{T,S, [elognj) is bounded as 
Pr < n"^-^^/2 (see Corollary [C]) . 

It is easy to see that if, T, an instance of T^, is of height less than some integer /, then the 
disagreement probability on the root of T in C{T, S, I), is zero. 

In C{T,S,l), where T is an instance of rooted at r, the disagreement probability pr 
depends only on the structure of the instance of T^, for a given S. We remind the reader 
that in C, we assume that the boundary conditions are set so as to maximize the disagreement 
probability pr- Clearly, pj. is a random variable. We use the Lemma iB] and Lemma O to derive 
an upper bound for the expectation of pr- The expectation of pr depends on /, S and t, the 
maximum number of children of a mixing vertex. Let q(t) be the probability for a random 
variable, distributed as in B{n — l,d/n), for fixed d, to be less than t. 

Lemma D For positive integers S, I, real d > 1, in the coupling C{T,S,l), where T is an 
instance of Td, the expectation of the disagreement probability pr, on the root of T , is bounded 
as 

^^P^^ ^ {^J^^2^it) + ('^(l - m) + ^ (exp { (5^} - (6) 

Proof: We remind the reader that t stands for the maximum number of children of a mixing 
vertex. Let q{t) be the probability for a random variable, distributed as in B{n — l,d/n), for 
fixed d, to be less than t. Let 



i{i<t 



a(i) 



t-S 
S 

otherwise 



where qi^s^ is as defined in the statement of Lemma O 

Consider the coupling C{T,S,l), where T is an instance of rooted at the vertex r. Let 
E\pr] be the expectation of the disagreement probability on the root r. Conditioning on the 
number of children of r and the disagreement probability p^, G CHj. in C{T, S, I), by Lemma 
IB] and Lemma O we have 

E\pr\Pw, € CHr] = a{\CHr\) ^ Pw 

w£CHr 

By definition, \/w & CHr, Pw upper bounded by the disagreement probability on the vertex 
w in the coupling C{Tw,S,l — 1) where is the subtree of T rooted at vertex w. Call this 
disagreement probability p"^. We clear out that pw refers to the coupling C{T,S,l) while pj^ to 
C{Tw,S, I — 1). It is direct that 

n 

E[pr] < ^m(i)Pr[|Ci^^| = i]E\pl] for w G CHr 
i=0 

Note that the random variables for w € CHr are identically distributed and independent 
of the number of children of r. Also, noting that the function /(i) = i ■ a{i) is increasing for 
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t « S and by the fact that the distribution of the number of children of r is dominated by the 
B{n,d/n), (by proposition 9.1.2. of [E]), it holds that 



E\pr]<J2z.a{i)ryil-pr-'E\p*J iorweCHr (7) 



1=0 



where p = d/n. Let Si = E*=o^ ' a(*)(")p'(l " and S2 = E7=t+ii- aii)Q)p'i^ - pT~' ■ 

Before calculating 5*2, we eliminate the probability term qt^s from the a{i) for i > t. For qt^s it 
holds that 



qi,s > 5 ^1 - (1 - gi,(5-i) 



i.e. the probability of the event "not choosing some element of [5] after i trials" is greater than, 
or equal to the probability of the event "not choosing exactly one element of [5]", since the 
second event is a special case of the first one. Furthermore, since qk,{s-i) ^ Qk,s we get that 

qi,S > S - {I - qi,s) 

Let Q, = {1, . . . ,n} and let to = sup{t G Cl\ qt^s ^ 1/2}- Instead of using qi^s we make the 
following simplification. For i > to we bound the quantity l/qi,s as 

1 ^ 2 2/5 



Also, for i < to, clearly, l/qi^s < 2. 



n — l\ i,^ „ o ■t-^ / n - 1 



< 25d(i-E*=U"7V(i-pr-'-0 + 



n— 1— j 



\n—\—% 



+2^,d (^(1 - p + - Elzl ryi) (3^) -P)^ 

< 25d(l-g(t)) + 2-^dffl + ^^'V '-Ef''7^V^(l-^)""'" 



5-1 vV 5-ry ^\ « 

< 2Sd{l-q{t)) + 2^^d(^l^l + ^^py ' -q{t) 

< 2d (^5(1 - g(t)) + ^ (exp{d/(5 - 1)} - g(t))) 
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Substituting the bounds for and ^2 in ([7]) we get 

E[Pr] < {dj^^q{t) + 2d (^5(1 - q{t)) + ^ (exp{d/(5 - 1)} - E[pl] 

for w £ CHr- We can substitute in the same manner as £^[pr]- Using induction and 

assuming that for the vertices at distance / from the root the expectation of the probabihty of 
disagreement is 1, the lemma foHows. o 

Finally, Lemma |E] follows by setting appropriate quantities for S and /, in ([B]) and then 
applying the Markov inequality. Here it is crucial to remark that if d is sufficiently large, then 
for t>7dit holds q{t) > 1 - d'^s. 

Lemma E Consider a system PCS{Gjj^d,eiogn,S), for d> I, e= ^j^^^J^^^) '^'^d for Gv,d,e\ogn 
we assume that it is a tree. If the cardinality of S is a sufficiently large constant, then with 
probability at least 1 — 2n~^''^^ , for the above system it holds that SD{v, [elognj) = For 
sufficiently large d, we should have S > d^^. 

Proof: By Lemma iDl we have that, in the coupling C {Gy^d,e log m S , C (L)), the expectation of 
Py, is bounded as 

where I is the minimum distance of v and the boundary set L, q{t) is equal to the probability for 
a random variable, distributed as in B{n — 1, d/n), for fixed d, to be less than t, the maximum 
number of children of a mixing vertex. 

Set / = elogn, with e = To prove the lemma it suffices to show that for S 

as described in the statement of the lemma and appropriately large t we have E[py] < 
Clearly, for E\py] < n~^'^ by using the Markov Inequality (see |3j) we can get that 



By Definition |F] and Theorem iDl we get that if E[py] < then with probability at least 1 — 

Pr\py > 2n-i-25] > l-2n-i-25 for the system PC5'(G^,,rf,eiogn, 5) it holds that ^^(t;, [elognj) < 
n~^'^^, which proves the lemma. 

Thus, what remains to be shown is that there are appropriate values for t and S such that, 
E\pv] < 

First, we show that if d is a sufficiently large constant, then for S > d^^ and t such that 
lii) > 1 — we get E[py] < Using Corollary |B] (from [B]) we see that when t > 

max{7d, 28 log d+ 1} it holds q{t) > 1 - d^^®. 

Corollary B If a random variable Z is distributed as in B{n,q) with X = nq then 

Pr[Z >x]<e-'' x> 7X. (9) 

For a proof of the above corollary, see in [6| the proof of Corollary 2.4. 

Assuming that d is a sufficiently large constant, we substitute the values of S with d^'^ and 
t = 7d, in ([8]) and we get 

^, , / 7<i"= .,r,14. 28 <i" f d S .'liWX'""' 
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where < ^ < d/{d^'^ — 1). In the above inequahty we used the fact that 1 — d~'^^ < q{t) < 1, 
and substituted e^^^'^^^^ by using the MacLaurin series of the function f{x) = e^, for real x. 
Thus, we get 



< U ^^-T^ + 2d~^ + + + 



€ log n 



^ ^(l-7d-13)2 (l_d-14)2 (l_d-14)3 l-d-^yj 

Taking d at least 20, we get that 

Replacing e, we see that it suffices to hold 0.91og(9.2(i-i2) < _io log(e2d/2), or 9.2d-^^ < 
{e'^d/2)~^^'^^ which clearly holds for sufficiently large constant d. 

For relatively smaller d, one can easily see that setting S = d^, for an appropriately large 
constant exponent x and arranging the quantity t so as q(t) > 1 — d~'^^ similarly to the previous 
case, can get 

^, , f, .+2 / t/d , 1 2d-'-^ 2 <i'«i \ V'°'" 

^ [" [irrTHf^ + + rrTR + (irrTF^ + 'JT^d^^ } ) 

We take x sufficiently large so as to have 1 — d~^ > 1 — 10^^ and xd~^ < 10^^. 

1ft = 7d, then, with the above assumptions, we can easily derive that E\py] < (^"^^^16)^'°^"". 
For this case, if E\py] < n"^'^, then we should have lGd~^^'^ < (e'^(i/2)^^'^^, which clearly holds 
for sufficiently large x. 

If 2xlogd + 1 > 7d, then by Corollary [B] we should have t = 2xlog(i + 1. With the 
assumptions we have made for X we get that < (d~^"'"^(2.1a;^^+9))'^^°s". If£'[pi;] < n^^-^, 

then it should hold ((i-^'+2(2.1x^ + 9)) < (e2d/2)"-", which clearly holds for sufficiently large 

X. 

The lemma follows 



3.3 The case of a unicyclic graph - How to prove Lemma [HI 

Consider an instance of Gn^d/m the set of its subgraphs Gi,^d,eiogn, as in 13.21 By Lemma 
\K\ we have that w.h.p. G^^d,eiogn is either a unicyclic graph or a tree. Here, we condition that 
that Gv^d,eiogn is a unicyclic graph. 

First, we show how can we extend the techniques for proving Lemma lEl i.e. proving spatial 
mixing properties of system with underlying graph a tree, to showing Lemma IHl which refers 
to systems with a unicyclic underlying graph. 

Consider the depth ffi'st search in Gy^d,eiogn, that starts from the vertex v and let u be the 
first vertex of the unique cycle that is reached by the search. Clearly, there are two possible 
choices for this search to explore the vertices of the cycle that u belongs to. If wi and W2 are 
the vertices on the cycle that are also adjacent to u, then let and be the two depth- first 
search trees of Gv^d,eiogn, rooted at v, with the first tree having u adjacent only to wi (not 
adjacent to 102) and the second having u adjacent only to W2- 
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Definition L With the above notation, the tree T^^d,t\ogn is isomorphic to the tree that comes 
up from the union of and T^^ plus an edge connecting the vertices u in with W2 in T^^ . 
The root r of T^^d^e log n corresponds to the vertex v inT^ . 

Note that the number of children of a nonleaf vertex of Tr^d,tiogn has distribution which is 
dominated by the B{n,d/n) with the condition that it is at least 2. 

Each of the trees and T^^, in the definition oiTr^d,eiogn, are isomorphic to some subgraph 
of G^d eiogn; there is a correspondence between the vertices in and with the vertices 
in Gv^d,eiogn- Based on this correspondence, we can define a function h : Vr ^ Vq where, Vr is 
the set of vertices of Tr^d,eiogn and Vq the set of vertices in G^^^^eiogn- 

Let L be the set that contains all the vertices in Gy^d,eiogn that are at graph distance, at 
least, [elognj from v. Consider the iS-colouring Ci(L) which is such that, the set the total 
variation distance of the Gibbs measures n{Xy\Ci{L)) and fi{Xv), as these are specified by the 
system PCS{Gv^d,eiogn,<S), is maximized. 

For the tree Tr^d,€iogn derived by Gv^d,eiogn, the integers S and I, let C'{Tj.^d,eiogn,S, I), be the 
coupling of the stochastic processes ColourRoot(Tr^rf^eiogn, S, C'[{A)) and ColourRoot(rj.^d^eiogn) S). 
The set A is such that \/v € A 3u e {Vi U L) such that h{v) = u and Gj{v) = Ci{u). The 
difference of C from C regarding the coupling of the processes Colour Root is that, if for a non 
mixing vertex u in T, which has i children, each vertex w G CHu has disagreement probability 
Pyj, then the disagreement probability pu, is bounded as 

where qi^s,2 is the probability of the event that after k trials, not all elements of the set [S] have 
been chosen, when at each trial we choose u.a.r. a member of [S] and conditioning that the first 
two trials chose different elements of [S]. 

Comparing the bound in (llOp with that was given in ([3]) in the statement of Lemma ICj it is 
easy to see that qi^s,2 < Qi,s- This implies that the coupling C can exist as, for all the vertices 
of T, it gives the same, or worse bounds for the probabilities of disagreement than C, on the 
same input. 



Lemma F Consider the graph Gv,d,eiogn o-nd the corresponding tree Tr^d,eiogn, for d > 1 and 
e = 4 \og(e^d/2) • ^he hound for the disagreement probability that we derive from Lemma 

W\ and [W\) for the coupling C'{Tr^d,eiogm<S, [elognj), then it holds that SD{v, [elognj) < 2pr 
in the system PC S{Gy^d,e\ogn, > for any positive integer S. 



Proof: Let u be the vertex in Gv^d,tiogn which belongs to the unique cycle of Gy^d,tiogn and 
among all the vertices in the cycle it has the smallest distance from v. Let be the connected 
subgraph of Gy^d,t\ogn that contains the vertex u and the vertices whose distance from v is 
greater than that of the vertex u from v. It is easy to see that that G" is a unicyclic graph and 
Gy^d,e log n\G'^ is a tree. 

Let L be the set that contains all the vertices in Gy^d,eiogn that are at graph distance, at 
least, [elognj from v. Consider the <S-colouring Ci{L) which is such that the total variation 
distance of the Gibbs measures fj,{Xy\Ci{L)) and fi{Xy), as these are specified by the system 
PCS{Gy^d,eiogn,'S), to be maximized. 

Assume that there is a coupling such that choosing uniformly at random a member from 
each of the sets of 5-colourings U{G'^ , S , Ci{L)) and Q{G^,S), correspondingly, the probability 
for the two members to specify a different colour assignment for the vertex u is Q. Clearly, 
using Lemma IB] and (jlOp we are able to derive an upper bound for the disagreement probability 
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Py, of the coupling of the processes ColourRoot(r, 5, Ci(T n L)) and ColourRoot(T, 5), where 
T = Gy^d,tiogn\iG^\{u}) and with the assumption that the disagreement probabihty of the 
vertex u is set apriori to Q. Note that the graph Gy^(i,e log n\{G^\{u}) is a tree, i.e. it includes 
all the vertices in Gy^d,eiogn apart from the vertice in G^\{u}. This bound of p„ is also an upper 
bound for the total variation distance of the Gibbs measures, fi{Xy\Ci{L)) and iJ,{Xy), as these 
are specified by the system PCS{Gy^d,e\ogn,S). 

The lemma will follow by showing that for an appropriately constructed tree T", with respect 
to G", and appropriate boundary condition C(, if in coupling of the processes ColourRoot(T", 
5, C() and ColourRoot(T", S) has disagreement probability P at the vertex u, the root of T", 
then it holds Q < P, where the probability Q is as previously defined. W.l.o.g. assume that 
the vertex v belongs to the unique cycle of Gy^d,€iogni i-s. the and Gy a eiogn Sive identical. 

Each of the trees and T^^, in the definition of Tr^d,eiogn (Definition [L]) , are isomorphic 
to some subgraph of Gy^d,eiogn, i-e. there is a correspondence between the vertices in and 
with the vertices in Gy^d,eiogn- Based on this correspondence, we can have a function 
/i : Vt — > Vg where, Vr is the set of vertices of Tr^d,tiogn and Vq the set of vertices in Gy^d,eiogn- 

Let L' be the set of vertices in Tr^d,eiogn such that L' = {« € VT\h{u) G L}. It is direct that 
the vertex set L' is at distance, at least, [elogrej from r, in Tr^d,eiogn- is the vertex set that 

contains all the adjacent vertices of v in Gy^d,eiogn, then for wq £ Ny, let G""' be the connected 
component of Gy^d,t\ogn\{v} that contains u'o- It is straightforward that the component G"*" is 
a tree which is isomorphic to the subtree of Tr^d,eiogn, where h{w) = wq. 

Let Gy(L') be a colouring such that, for each vertex w £ L' it specifies the same colour 
assignment as Gi{L) specifies for wq G L, where h{w) = wq, for i = 1,2. The fact, that, for 
any wq E A''^ and w € CHr such that h{w) = wq, the graphs G'^" and Ty; are isomorphic 
implies that there is a correspondence between the elements of the sets ^1{G^° ,S,Gi{L)) and 
^l,{Tw,S, Cj^{L')) such that for any two corresponding colourings G^ and G^ it holds that Vui G 
T.yj G^(/i(mi)) = G^(ni). Clearly, there is a similar correspondence between the members of the 
sets of 5-colourings Vl{G'^° ,S) and 0(r^,(S), for wq G A^'^,, w G GHj. and h{w) = wq. 

Using the above correspondences between the pairs of sets Q,{G^'\S, Gi(L)), Q{Tw,S, G)p{L')) 
and J7(G'"'',5), Q{Ty;,S,Gj'{L')), with the assumption that \/w G GHr, in the coupling C 
(Tr,(i,e log n ,5, [e log nj ) the disagreement probability for the vertex w is upper bounded by p^, 
we get the following: There is a coupling such that choosing u.a.r. from the sets of 5-colourings 
f](G"''', 5, Gi(L)) and r2(G'"°,5) such that the probability for the two chosen elements to assign 
different colour to the vertex wq is upper bounded by pyj, where h{w) = wq. Furthermore, there 
is a coupling that chooses u.a.r. from the sets ^{Gy^d,e log n\{i'},'S, Gi(L)) and ^{Gy^d,eiogn\{v}) 
and there is at least one vertex A^^ that the two choices specify a different colour assignment 
with probability upper bounded as J2w&CHrPw- The bound for this probability is derived by 
using the union bound. 

Remark. Note that if wi and W2 are the vertices in Ny that also belong to the cycle, then the 
events ^^there is disagreement on the vertex Wi \ for i = 1,2, are correlated. However, the 
union bound we used for bounding the probability for at least one vertex in A'^ to be differently 
coloured, in the coupling, still holds even for correlated events. 

Claim D //, the coupling C'{Tr^d,e\ogmS, [elognj), has probability disagreement on the vertex 
r upper bounded by pr, then we can have a coupling that chooses u.a.r. a member from each of 
the two sets ^l{Gy^d,eiogn,'S,Gi{L)) and ri{Gy^d,e\ogn-,S) such that, the probability for v to have 
different colour assignments to be upper bounded by pr ■ 

With the above claim and what follows, we get the proof of the lemma. Let G{L) and C{L) 
be the two colourings which maximize the total variation distance of the measures ii{Xy\G{C (L)) 
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and n{Xy\C{L)), as these are specified by the system PCS{Gy^d,eiognyS)- 
SD{v,elogn) = drv {KXv\C{C{L)), fi{X,\C{L))) 

< drv {KX.,\C{C{L)),fi{X,)) +dTv {fi{X,),fi{X,\C{L))) 

where pr is the bound of the disagreement probabihty on the vertex r , at the couphng C {Tr^d,e log n 
5, [elognj). o 
We now prove the claim that appears in the proof of Lemma iFl 

Claim D If, the coupling C'{Tr^d,e\ogm<S, [elognj), has probability disagreement on the vertex 
r upper bounded by Pr, then we can have a coupling that chooses u.a.r. a member from each 
of the two sets Q,{Gy^(i,e\ogn,<S,Ci{L)) ^{Gy^d,e\ogn,<S) such that, the probability for v to have 
different colour assignments to be upper bounded by pr ■ 

Proof: Assume that in the coupling C'{Tr^d,e\ogn-,S, [elognj) each vertex of Tr^d,e\ogn-, whose 
number of children is at most t', for some positive integer t', is considered mixing. The vertices 
r and v have the same degree in Gv^d,e\ogn and Tr^d,e\ogn-, correspondingly. 

Assuming that \/w G CHr, in the coupling C'{Tj.^d,e\ogn-,S., [e log nj ) the disagreement proba- 
bility for the vertex w is upper bounded by p^, we can see that there is a coupling that chooses 
u.a.r. a member from each of the sets VL{Gv^d,e\ogn\{v}-,S,Ci{L)) and ^{Gv^d,e\ogn\{v}-,S) and 
there is at least one vertex Ny that the two choices specify different colour assignments with 
probability upper bounded as ^w^cHrPw- Thus, the claim follows if, based on the above as- 
sumption, we give a coupling that selects u.a.r. a member from Q.{Gv^d,e\ogniS.,Gi{L)) and 
^{Gy^d,€\ogn-,S) such that, the probability for v to have different colour assignments to be upper 
bounded by the probability of disagreement pr in the coupling C'{Tr^d,e\ogn-,S^e\ogn). 

If the degree of the vertex v is at most t', then we use Lemma |B] to get an upper bound 
for the probability for a coupling that chooses u.a.r. from the sets ri{Gy^d,e\ogn-,S,Ci{L)) 
^{Gv,d,e\ogn-,S) to choose two members that specify different colour assignments for the ver- 
tex V. Clearly, we get the same bound for the probability of disagreement pr in the coupling 
C'{Tr4^e\ogn,S, [elognJ). 

We note that despite the fact that the colour assignments of two adjacent vertices of v are 
correlated, we can still apply Lemma [Bl This is because in Lemma iB] it is assumed that, if 
there is a disagreement in the vertices in CHy (to be exact with the context of the proof we 
have to write N^)^ then all the vertices have different colour assignments. This leads clearly to 
an overestimate for bounding the disagreement probability for p^ even for the case where the 
two colourings are correlated. 

If the degree of the vertex v \s i, greater than t' , then we use Lemma [Cl with a little 
modification, to get an upper bound for the probability for a coupling that chooses u.a.r. from 
the sets ^{Gv^d,t\ogn-,S, Ci{L)) and ^{Gy^d,e\ogn-, S) to choose two members that specify different 
colour assignments for the vertex v. One can see that the term in © of the statement 

of Lemma O is not exact for our case. More specifically, in the last paragraph of the proof of 
Claim IHl for our case the quantity qp is not equal to Qi^s due to the fact that two vertices do not 
choose independently their colour assignment. However, it is direct that qp is lower bounded 
by the probability of the event that after i trials, not all the elements of [S] have been chosen, 
when at each trial we choose u.a.r. a member of [S] and conditioning that the first two trials 
choose different elements of [S] . With this modification we can see that the coupling of choosing 
u.a.r. from ^{Gy^d,eiogm<S,Ci{L)) and ^{Gv^d,eiogn,'S) can be done such that the vertex v to 
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have different colour assignments with probabihty 

Pv < P'^- 

where ^4,5,2 is the probabihty of the event that after k trials, not all elements of [S] have been 
chosen, when at each trial we choose u.a.r. a member of [S] and conditioning that the first two 
trials chose different elements of [S]. 

The claim follows. o 

Towards proving Lemma [Hi we use Lemma IF] which allows us to consider the tree T^^d^eiogn 
derived by unicyclic graph G^^d^eiozn-, instead of Gv^d,t\ogn- In turn, we can use the same 
techniques as in section 13.21 for showing the desired spatial mixing properties for systems with 
underlying graph the tree Tr^d,t\ogn- Note that now the coupling is C. Let q{t) be equal to the 
probability for a random variable, distributed as in B{n — 1, d/n), for fixed d, to be less than t. 

Lemma G For positive integers S, I, real d > 1, in the coupling C'{T,S,l), where T is an 
instance of Tr^d,e log n, the expectation of the disagreement probability pr, on the root of T , is 
bounded as 

E[Pr] < (r3(^^ + 2d ('^(i - '^(^)) + { 5^} - '^(^^ 

Proof: We remind the reader that t stands for the maximum number of a children of a mixing 
vertex. Let q{t) be the probability for a random variable, distributed as in B(n — l,d/n), for 
fixed d, to be less than t. 

One can see than in the Tr^d,eiogm the number of children of a nonleaf vertex has distribution 
which is dominated by the B{n,d/n) with the condition that there are at least two children. 
Let Z be a random variable distributed as in B{n,d/n), clearly 

Pr[Z >2] = l-(l--Y-n-(l--Y \l-(d + l)e-'^ 
V nj n \ nj 



Let 

a(i) 



t-S 

ifi<t 



(s - ty 

S 



otherwise 



(li,S,2 

where qi^s,2-, is as defined in ([TO]) . Consider the coupling C'{T,S,l), where T is an instance of 
7r,d,eiogn rootcd at the vertex r. Let E[pr] be the expectation of the disagreement probability 
on the root r. Conditioning on the number of children of r and the disagreement probability 
Pw, G CHr in C'{T,S,l), by Lemma IB] and Lemma Owe have 

E\pr\pw, G CHr] = a{\CHr\) Pw 

weCHr 

By definition, \/w E CHr, Pw is upper bounded by the disagreement probability on the vertex 
w in the coupling C'{Tyj,S,l — 1) where Ty^ is the subtree of T rooted at vertex w. Call this 
disagreement probability p^. We clear out that p^ refers to the coupling C'{T, S, I) while to 
C'(Tw,S,l — 1). It is direct that 

n 

E\pr] < ^ia(i)Pr[|Cif^| = i]E[p*J for w € CHr (11) 

i=0 
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Also, noting that the function f{i) = i ■ a{i) is increasing for t « S and by the fact that the 
distribution of the number of children of r is dominated by the B{n, d/n), (by proposition 9.1.2. 
of [13]), it holds that 

^[^^] ^ E [fjP'i^ - Pr~'E[p*J for w G CHr 

where p = d/n. 

Let Si = ELo^ • a{i)Qp'{l-pr-' and ^2 = E7=t+ii ' a{i)Qp'{l - pr~\ Using the 
derivation of Lemma [D] we have that 

Si = 

Before calculating 5*2, we eliminate the probability term qi^s,2 from the a{i) for i > t. Note that 
Qi,S,2 > 91-1,5-1, i-e. is the probability for not choosing all the elements of a set of 

cardinality 5 — 1 after i — 1 trials when at each trial we choose u.a.r. a member of the set. For 
Qi,s,2 it holds that 

qi,s,2 > qi-i,s-i >{S-l)(^l- -^^^ (l - 9i-i,(<s-2)) 

i.e. the probability of the event "not choosing some of the 5 — 1 elements after i — 1 trials" is 
greater than, or equal to the probability of the event "not choosing exactly one of 5 — 1 elements 
after i — 1 trials" , since the second event is a special case of the first one. Furthermore, since 

9j-i,(5-2) < qi-i,s-i we get that 

q^-l,s-l > (5 - 1) (^1 - ' (1 - g.-i,5-i) (12) 

Let ri = {1, . . . ,n} and let to = sup{t S il| qt-i,s-i ^ 1/2}- Instead of using we make 

the following simplification. For i > tQ we bound the quantity 1/qi-i^s-i as 

< 



Also, for i < to, clearly, 1/qi^s ^ 2. With derivations similar to those in the proof of lemma iDl 
for 52 we get that 

52 < 2d (^5(1 - q{t)) + ^ (exp{d/(5 - 2)} - q{t)) 
Substituting the bounds for Si and 52 in ([TT]) we get 

^^r] < {djj^qit) + 2d (^5(1 - q{t)) + ^ (exp{d/(5 - 2)} - g(t)))) E[p: 

for w G CH,.- We can substitute -^IpJ^] in the same manner as E[pr]. Using induction and 
assuming that for the vertices at distance / from the root the expectation of the probability of 
disagreement is 1, the lemma follows. o 

Lemma |Hl follows by combining the lemmas |F] and O 
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Lemma H Consider a system PCS{G^^d,eiogn,S), for d > 1, e = ^^^^^J^^^^ and for G^^d,eiogn 
we assume that it is a unicyclic graph. If the cardinality of S is a sufficiently large constant, then 
with probability at least 1 — 2n~^'^^, for the above system it holds that SD{v, [elognj) = n~^''^^ . 
For sufficiently large d, we should have S > d^'^. 

Proof: To prove the lemma we first show that using the couphng C {Tr^d,eiogm [^^ogn\) for 
the system PCS{Tr^d,e\ogn,'S), it holds that SD{r, [elognj) = n~^'^^ with probability at least 
1 — 2n~^'^^, when 5 is a sufficiently large constant and for sufficiently large d, we should have 
S > d^^. Then, the lemma will follow by using Lemma iFl 

By Lemma Owe have that, for the coupling C over the trees Tr^d,eiogn when S colours are 
used and the boundary set L is at distance, at least, I from the root r, the expectation of pr is 
bounded as 

< [y^-J^^P[)^ + 2^ ('^(l - + (exp{(i/(5 - 2)} - q{t)) 

(13) 

where I is the distance of v and the boundary L, which in our case is I = [elognj, with 
e = i^g^g^d/2) • ^IsOi ^(0 is equal to the probability for a random variable, distributed as in 
B{n — l,d/n), for fixed d, to be less than t, the maximum number of children of a mixing 
vertex. 

Set I = elogn, with e = i^gl^^-Fd/2) &■ To prove the lemma it suffices to show that for S 
as described in the statement of the lemma and appropriately large t we have E[pr] < n~^'^. 
Clearly, for E[pr] < n~^'^ by using the Markov Inequality (see [3]) we can get that 



Pr\p. > 2n--^] < ^ = n--V2 



By Definition [F] and Theorem iDl we get that if E[pr] < n then with probability 1 — Pr[pr > 
2^-1.25] > 1 _ 2n-i-25 the system PCS{Tr^d,eiogn,S) it holds SD{r, [elognj) < n'^-^s, which 
proves the lemma. 

First we show that for sufficiently large d, for S > d^^ and t such that q{t) > 1 — d"'^^ we 
get E\pr] < n^^-^. By Corollary [B] (in the proof of lemma lEj) we have that if a random variable 
Z is distributed as in B{n, q) with X = nq then 

Pr[Z > xj < e""^ x> 7A. (14) 

Thus, for t = max{7d, 28 log d + 1} it holds q{t) > 1 - d'"^^. 

Assuming that d is a sufficiently large constant, we substitute S and t in (fT3|l and we get 




f 1 ( 7d^^ 

2d d e^ d' . . ' ' ' 

where < ^ < d/{d^^ — 1). In the above inequality we used the fact that 1 — d"^^ < q{t) < 1, 
and we substituted e'^/^'^^^^ by using the MacLaurin series of the function f{x) 
X. Thus, we get 

/ / 7 2d-^ 2 +2d-i5 



l-((f+l)e-M (l-7d-13)2 l-d-14 (l_d-14)2 (1 _ t^-14)3 1 _ (^-14 
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Taking d at least 20, we get that 



Replacing e, we see that it suffices to hold 0.91og(9.2d-^2) < -10 Iog(e2d/2), or 9.2d-^'^ < 
{e'^d/2)~^^'^^ which clearly holds for sufficiently large constant d. 

For relatively smaller d, one can easily see that setting S = d^, for an appropriately large 
constant exponent x and arranging the quantity t so as q{t) > 1 — d~'^^ similarly to the previous 
case, can get 

, , ( / t/d 2d-i 2d— 1 2 \yi°s" 



I - {d + l)e-'^ \{1 - d-ny l-d-^ (l-d-^)2 {i - d-^f j j 

We take x sufficiently large so as to have 1 - d""" > 1 - 10"^ and xd"^ < lO'^. 

If t = 7(i, then, with the above assumptions, about x, we can easily derive that E[pr\ < 
((i~^+244)^iogn. For this case to have E[pr\ < n'^-^ we should have 44^"^+^ < {e'^d/2)^'^-^^ , 
which clearly holds for sufficiently large x. 

If 2xlogd + 1 > 7d, then by Corollary [B] we should have t = 2xlogd + 1. With the 
assumptions we have made for x we get that E\py] < (d~^+2(8x^2|^ + 16))^ Thus, if 
E\pr] < then we should have ((i^^+2(8x^ + 16)) < {e'^d/iy^-'^^ , which clearly holds for 

sufficiently large x. 

The lemma follows by using Lemma iFl 



4 Properties of the algorithm 

We close this work by restating and proving the two theorems that deal with the issues of 
accuracy and efficiency of the sampling algorithm. 

Theorem B If S is a sufficiently large integer constant, then, with probability 1 — 0{n~^'^), 
the sampling algorithm is successful and returns a S- colouring of the input graph G, whose 
distribution is within total variation distance n~^''^^ from the uniform over all the proper S- 
colourings of G. 

Proof: The algorithm is considered successful if the spin-system it considers has the properties 
stated in section [2?H i.e. the following two hold: First, for each iteration i, the induced subgraph, 
of the input graph, that contains Vi and all the vertices that are within graph distance [e log n\ 
from Vi, with e = /^\og(de'^ 12) ' either unicyclic or a tree. According to Lemma lAl this holds 
with probability at least 1 — n~^'^. Second, the spatial mixing property stated in Theorem lAl 
holds. For sufficiently large constant S, the spatial mixing property stated in Theorem VK\ holds 
with probability at least 1 — n~^''^^ . 

Consider that G is the input graph of the algorithm, which is an instance of Gn^d/n ^-iid we 
take S as large as indicated in Theorem |X1 The, the algorithm is successful with probability at 
least 1 — {n~^'^ + 2n~^''^^) = 1 — 0{n~^''^). From now on, we assume that the input G, belongs 
to this set of instances of G^^d/n that the algorithm is successful (which includes almost all 
instances, for sufficiently large S). 

What remains to be shown is the bound for the total variation distance between the distri- 
butions of the colouring that is returned by the algorithm and the uniform over all the proper 
5-colouring of the input graph G. 
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First we show that ([T]) is valid. Consider two systems, i.e. Si = PCS{G,S) and S2 = 
PCS{G,S), each independent of the other. Assume that, in both systems, we fix the colour 
assignments of the vertices in Ai, according to probability measure /i(-). Clearly, we have for 
both systems that Pr[XA, = CiAi)] = fi{XA, = C{Ai)), for C{Ai) G [S]^\ After fixing the 
colour assignments of the vertices in Ai we look at the colour assignments of the vertices at 
graph distance, at least, [elognj from the vertex Vi, in both systems. Let V' be the vertex set 
whose colouring is revealed and let C{V') be the colouring we see in 5*1 and let C'{V') be the 
colouring we see in 5*2. 

By the law of total probability, it is easy, for one, to see that, in Si, the probability for Vi 
to be assigned colouring c, Vc € [S], is equal to fJ-{Xy. = c\Ci{V')). Similarly in 5*2, the the 
probability for Vi to be assigned colouring c, Vc G [S], is given by the measure fJ-{Xy^ = c\Ci{V')). 

Note that in the above we have fixed the colourings of the vertices according to //(•), but we 
do not have seen what are the actual colourings. We see, only, the colourings of the vertices in 

v. 

By Theorem |A] and the discussion at the end of the section 11.21 we have that the total 
variation distance between the probability measures of interest, is bounded as follows: 

dTv{KXv,\Ci{V')),fi{X,^\C2{V'))) < SD{vi, LelognJ) < n-^-^s (15) 

Which shows that ([T]) is valid. Thus, the asymptotic independence between the colouring of 
the vertex Vi and the colouring of the vertex set V' remains when we colour the vertices in Ai 
according to distribution ^(•). 

Consider the following coupling of our algorithm and an ideal algorithm that gives a per- 
fect uniform sample by colouring vertices one by one in some way. At each repetition, both 
algorithms assign a colour to some (the same) vertex in Gn^d/n- Consider a specific step of the 
coupling where the vertex v is to be coloured. By Theorem \K\ and ([T]), we can have a sufficiently 
large S such that, conditioning on the fact that all vertices until now are identically coloured by 
the two algorithms, the probability for v to have a different colour assignment in the coupling 
is at most n~^'^^. Thus, the probability for the coupling to end with no disagreement is at least 
(1 — n"^-^^)"- > 1 — n~°-^^. The theorem follows. o 

Theorem C The time complexity of the sampling algorithm is w.h.p. asymptotically upper 
bounded by 0{n'^), where n is the number of vertices of the input graph. 

Proof: First, we note that the algorithm will return failure if any of the graphs B^^ is neither 
unicyclic, nor a tree. The number of step the algorithm needs, in the case of failure, is at most 
equal to the number of steps that are need in the case of nonfail. Thus, time complexity of the 
nonfailing execution is an upper bound of the time complexity of the algorithm. 

The algorithm needs 0{n) steps to create the graph By., at the z-th iteration of its for loop. 
The graph 5^,. can be created by using any traversal algorithm, e.g. depth first search. This 
time bound follows by the fact that the number of vertices and the number of vertices in are 
upper bounded by the number of vertices and the edges of the input graph. Using the Chernoff 
bounds (see [6]) it is direct to show that w.h.p. the number of edges in an instance of Gn^d/n is 
0{n). 

Implementing a colouring of Vi according to fi{Xy^\C{Ai fl V^)), is equivalent to generating a 
random list colouring of B^. and keeping only the colour assignment of Vi from this colouring. In 
the list colouring problem every vertex u €Vi has a set L{u) of valid colours, where L{u) C [S] 
and u can only receive a colour in L{u). As argued in [4j, for a tree on / vertices we can exactly 
compute the number of A;-colourings in time I ■ k. Therefore we can also generate a random 
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list colouring of the tree. Also, for a unicyclic component we can simply consider all the < 
colourings of the endpoints of the extra edge and then recurse the remaining tree. 

I.e. the time we need to colour Vi according to fj,{Xy^\C{Ai n V^)), is at most 0(n). The 
theorem follows by noting that we need to colour n vertices. 

o 
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